Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model
نویسندگان
چکیده
One-to-many eigenvoice conversion (EVC) allows the conversion of a specific source speaker into arbitrary target speakers. Eigenvoice Gaussian mixture model (EV-GMM) is trained in advance with multiple parallel data sets consisting of the source speaker and many pre-stored target speakers. The EV-GMM is adapted for arbitrary target speakers using only a few utterances by estimating a small number of free parameters. Therefore, the initial EV-GMM directly affects the conversion performance of the adapted EV-GMM. In order to prepare a better initial model, this paper proposes Speaker Adaptive Training (SAT) of a canonical EV-GMM in one-to-many EVC. Results of objective and subjective evaluations demonstrate that SAT causes significant improvements in the performance of EVC.
منابع مشابه
Effects of Speaker Adaptive Training on Tensor-based Arbitrary Speaker Conversion
This paper introduces speaker adaptive training techniques to tensor-based arbitrary speaker conversion. In voice conversion studies, realization of conversion from/to an arbitrary speaker’s voice is one of the important objectives. For this purpose, eigenvoice conversion (EVC), which is based on an eigenvoice Gaussian mixture model (EV-GMM), was proposed. Although the EVC can effectively const...
متن کاملMon.O1d.06 Effects of Speaker Adaptive Training on Tensor-based Arbitrary Speaker Conversion
This paper introduces speaker adaptive training techniques to tensor-based arbitrary speaker conversion. In voice conversion studies, realization of conversion from/to an arbitrary speaker’s voice is one of the important objectives. For this purpose, eigenvoice conversion (EVC), which is based on an eigenvoice Gaussian mixture model (EV-GMM), was proposed. Although the EVC can effectively const...
متن کاملAdaptive Training for Voice Conversion Based on Eigenvoices
In this paper, we describe a novel model training method for one-to-many eigenvoice conversion (EVC). One-to-many EVC is a technique for converting a specific source speaker’s voice into an arbitrary target speaker’s voice. An eigenvoice Gaussian mixture model (EVGMM) is trained in advance using multiple parallel data sets consisting of utterance-pairs of the source speaker and many pre-stored ...
متن کاملAn improved one-to-many eigenvoice conversion system
We have previously developed a one-to-many eigenvoice conversion (EVC) system enabling the conversion from a specific source speaker’s voice into an arbitrary target speaker’s voice. In this system, eigenvoice Gaussian mixture model (EV-GMM) is trained in advance with multiple parallel data sets composed of utterance pairs of the source and many pre-stored target speakers. The EV-GMM is effecti...
متن کاملOne-to-Many Voice Conversion Based on Tensor Representation of Speaker Space
This paper describes a novel approach to flexible control of speaker characteristics using tensor representation of speaker space. In voice conversion studies, realization of conversion from/to an arbitrary speaker’s voice is one of the important objectives. For this purpose, eigenvoice conversion (EVC) based on an eigenvoice Gaussian mixture model (EV-GMM) was proposed. In the EVC, similarly t...
متن کامل